Partitioning Input Space for Control-Learning
نویسندگان
چکیده
This paper considers the eeect of input-space partitioning on reinforcement learning for control. In many such learning systems, the input space is partitioned by the system designer. However, input-space partitioning could be learned. Our objective is to compare learned and programmed input-space partitionings in terms of the overall system learning speed and proociency achieved. We present a system for unsupervised control-learning in temporal domains with results for both programmed and learned input-space partitionings. The trailer-backing task is used as an example problem. Many classic control-learning systems, such as Michie and Chambers' BOXES 11] and Barto, Sutton, and Anderson's ASE/ACE system 2], rely on the partitioning of a space of continuous input variables into a xed number of discrete regions. More recent systems, such as Fuzzy BOXES 18], have blurred the boundaries between the input regions but nonetheless rely on a partitioning of the input space. To use these systems, researchers have typically partitioned the space manually, prior to the application of the learning system. In these cases, the designer must analyze the problem to discover a suitable partitioning, or face a trade-oo between a ne partitioning that permits accurate approximation of complex functions or a gross partitioning that allows for rapid learning. The Self-Organizing Neural Network with Eligibility Traces (SONNET) scheme introduced by Hougen 4] is a general paradigm for the construction of connectionist networks that learn to control systems with a temporal component. In order to form mappings from input parameters to output responses, the system discretizes the input space by learning a partitioning of it, and learns an output response for each resulting discrete input region. SONNET systems have separate subsystems for learning input and output. The input subsystem learns input-space partitionings through self-organization and the output subsystem learns responses through the use of eligibility traces. Both input and output learning make use of topological ordering of the neural elements and associated neighborhoods, as in Kohonen's Self-Organizing Topological Feature Maps 8]. 1.1 Topology and neighborhoods Each SONNET subsystem consists of one or more artiicial neural networks. For each network there is a topological ordering of the neural elements that remains constant as the network learns. Each neural element is assigned an integer tuple of the same dimensionality as the network that uniquely deenes its coordinates in topology space. The existence of a network topology allows for the deenition of a distance function for the neural elements. This is …
منابع مشابه
Partitioning Input Space for Reinforcement Learning for Control
This paper considers the effect of input-space partitioning on reinforcement learning for control. In many such learning systems, the input space is partitioned by the system designer. However, input-space partitioning could be learned. Our objective is to compare learned and fixed input-space partitionings in terms of the overall system learning speed and proficiency achieved. We present a sys...
متن کاملModular SRV Reinforcement Learning Architectures for Non-linear Control
This paper demonstrates the advantages of using a hybrid reinforcement–modular neural network architecture for non-linear control. Specifically, the method of ACTION-CRITIC reinforcement learning, modular neural networks, competitive learning and stochastic updating are combined. This provides an architecture able to both support temporal difference learning and probabilistic partitioning of th...
متن کاملInput Space Partitioning for Neural Network Learning
Neural Network (NN) is a supervised machine learning technique, which is typically employed to solve classification problems. When solving a classification problem with the conventional NN, the input data fed into the NN often consists of multiple attributes of various properties. However, training the NN with all of the available input attributes may not lead to the desirable performance consi...
متن کاملNeural Incremental Attribute Learning in Groups
Incremental Attribute Learning (IAL) is a feasible approach for solving high-dimensional pattern recognition problems. It gradually trains features one by one. Previous research indicated that supervised machine learning with input attribute ordering can improve classification results. Moreover, input space partitioning can also effectively reduce the interference among features. This study pro...
متن کاملHierarchical Neuro-Fuzzy Systems Part II
This paper describes a new class of neuro-fuzzy models, called Reinforcement Learning Hierarchical NeuroFuzzy Systems (RL-HNF). These models employ the BSP (Binary Space Partitioning) and Politree partitioning of the input space [Chrysanthou,1992] and have been developed in order to bypass traditional drawbacks of neuro-fuzzy systems: the reduced number of allowed inputs and the poor capacity t...
متن کامل